Best explanation in this video made by myself.
In both cases we want to understand the derivatives as the velocity of curves in the same finite dimensional vector space . But the approach is different:
when we go from a point $p$ to a near point $p'$, being $\hat{T}_{p'}$ "what $T_{p'}$ should be", i.e., the pushforward of $T_p$ by means of the flow of $V$. See Lie derivative of vector fields#Personal interpretation.
We could write, informally, that
$$ T_{p'} \approx \widetilde{T}_{p'}+(\nabla_V T)_{p'} $$when we go from a point $p$ to a near point $p'$. The expression $\widetilde{T}_{p'}$ means what $T_{p'}$ should be, i.e., the parallel transport (which depends on the geometry) of $T_p$ along $V$.
Finally, since the torsion of a connection is the fail to close a parallelogram made of parallel transported vectors, we have the following computational relation between covariant derivatives and Lie derivatives:
$$ \mathcal{L}_V T= \nabla_V T- \nabla_T V-T (V, T) $$Also, most connections are torsion free, and in those cases we have:
$$ \mathcal{L}_V T=[V,T]= \nabla_V T- \nabla_T V $$________________________________________
________________________________________
________________________________________
Author of the notes: Antonio J. Pan-Collantes
INDEX: